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ABSTRACT 

We present a new statistic — the redshift dispersion — which may prove useful 
for comparing next generation redshift surveys (e.g., the Sloan Digital Sky Survey) 
and cosmological simulations. Our statistic is specifically designed for the projection 
of phase space which is directly measured by redshift surveys. We find that the 
redshift dispersion of galaxies as a function of the projected overdensity has a strong 
dependence on the cosmological density parameter f2. The redshift dispersion statistic 
is easy to compute and can be motivated by applying the Cosmic Virial Theorem to 
subsets of galaxies with the same local density. We show that the velocity dispersion 
of particles in these subsets is proportional to the product of $1 and the local density. 
Low resolution N-body simulations of several cosmological models (open/closed CDM, 
CDM+A, HDM) indicate that the proportionality between velocity dispersion, local 
density and Q holds over redshift scales in the range 50 km s _1 to 500 km s _1 . The 
redshift dispersion may provide an interesting means for comparing volume-limited 
subsamples of the Sloan Digital Sky Survey to equivalent N-body/hydrodynamics 
simulations. 



Subject headings: cosmology: observations — cosmology: theory — methods: data 
analysis — methods: numerical — surveys 



1. Introduction 

Many statistical measures have been developed to distinguish between the various cosmological 
models, ranging from direct measures of the power spectrum and correlation function from redshift 
surveys, to measures of velocity dispersion, bulk flows, and Mach number from peculiar velocity 
surveys (cf., |Strauss fe Willick 1995 for a review). Each of these measures is designed to be 
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sensitive to different aspects of various models, such as f2, the initial power spectrum, or the 
Gaussian character of the initial phase distribution. 

One can get an intuitive understanding of the dependencies of different statistics by 
considering the following example. Compare N-body simulations of four different cosmological 
models: standard CDM, open CDM, CDM + A, and a pure HDM model, the details of which are 
given in §3. Each simulation is normalized to have the same value of eg, the standard deviation of 
the mass fluctuations within an 8 h Mpc sphere, where h is the Hubble parameter in units of 100 
km s _1 Mpc -1 . The two point correlation function, £(r), for each model is presented in Figure 
la. Despite their differences in initial power spectra, the final correlation functions are similar in 
all models. In general, the correlation function does not depend strongly on f2 if one retains the 
freedom to set the a% normalization. In contrast, Figure lb shows the pairwise velocity dispersion, 
(v 2 )(r), as a function of separation for the same set of models. A strong differentiation between 
the low and high f2 models is apparent. 

The velocity dispersion is easy to compute in simulations, but is difficult to measure in the 
geometric projection of phase space that is measured by redshift surveys. The traditional way to 
measure the small-scale velocity dispersion of galaxies is via the anisotropy it introduces in the 
redshift-space two-point correlation function ( Pavis &: Peebles 1983 ; Fisher et al. 1994 : Marzke et 



al. 1995 ; Loveday et al. 1996| ). However, as it is a pair-weighted statistic, it is dominated by the 



densest regions, which are necessarily rare. Thus one finds large variance between estimates of 



the small-scale velocity dispersion between different samples (|Mo, Jing, fe Borner 1993; Zureket 



al. 1994 ; Guzzo et al. 199G[ ; [Somerville, Davis, fe Primack 1997] ), indicating that the small-scale 



velocity dispersion measured in this way is not very robust. Attempts have been made to use 



direct measures of peculiar velocities of galaxies as a constraint on (v ) (e.g., Strauss, Cen, fc 



Ostriker 1993, Willick et al. 1997| ), but other than the nearby universe, where the velocity field is 



observed to be very quiet ( (Bandage 1986 ; Brown & Peebles 1987; Burstein 1990), the errors on the 



individual peculiar velocity measurements swamp the signal from (v }. 

The primary goal of this paper is to present a statistic — the redshift dispersion (cr z ) — that 
captures (v 2 ) in a way which is naturally applied to volume limited samples drawn from redshift 
surveys. In the subsequent sections we motivate and present our redshift dispersion statistic. 
Again, because it is a pair- weighted statistic, computing (v 2 ) by averaging over all densities results 
in a value heavily weighted by the densest regions. However, this problem can be alleviated if the 
dispersion can be calculated as a function of density. The theoretical motivation for our statistic 
stems from the Cosmic Virial Theorem (CVT), derived in Peebles (1976a, hereafter P76), which 
relates (v 2 ) to Q, and £. In §2 we show that the CVT can be applied to subsets of particles in 
a system which correspond to surfaces of constant density. Many assumptions are necessary to 
obtain the results shown in §2, not all of which are obvious. In §3 we explore the relationships 
derived in §2 with simple N-body simulations, suggesting that the results of §2 hold over a wide 
range of scales. The redshift dispersion is entirely independent of the results of §2 and §3, which 
provide a context for the redshift dispersion that would otherwise be an unmotivated simulation 
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based statistic. §4 describes how to compute the redshift dispersion from a redshift survey. §5 
contains our conclusions and remarks on future work. 



2. Velocity Dispersion on Surfaces of Constant Density 

As will be shown in §4, the redshift dispersion probes the pairwise peculiar velocity dispersion 
in regions of different density and is designed for comparing simulated and observed redshift 
surveys. In this section we attempt to provide a theoretical context and some motivation. 

One of the main challenges of working with the pairwise velocity dispersion arises from its 
strong density dependence. Intuitively, both the number of galaxies and the velocity dispersion 
should be highest in the densest regions. Thus, averaging over all densities will give results 
dominated by rare, high density peaks. This problem can be eliminated if the dispersion is 
calculated as a function of density. 

In the case of clustered, gravitationally interacting particles there exists a scaling relation 
between the velocity dispersion on small scales {v 2 )(r) averaged over pairs separated by a distance 
r, and the two point correlation function £(r), the excess fractional probability of finding two 



particles with separation r. This result is contained in the CVT derived in P76 (see also Peebles 



1976b; Davis & Peebles 1977; Peebles 198C), which can be written 



(v 2 ) (r) oc f}£(r)r 2 . (1) 

Several assumptions are used to obtain Eq. (1) (see P76), including that £ is given by a power 
law £(r) = (ro/r) 7 ; r <C ro, implying £(r) S> 1; the three-point correlation function £ is given 
by a symmetrized product of the two-point correlation function (see Eq. [A23]); and the mass 
of each galaxy is concentrated on scales smaller than their separation. The validity of these 
assumptions is not obvious ( [Fisher et al. 1994 ). For an excellent discussion of the implications 



of extended dark matter halos on the CVT, see Bartlett & Blanchard (1996). In addition, we 
are assuming that the galaxy velocity field is unbiased with respect to that of the dark matter. 
While theoretical investigations have given strong suggestions that a velocity bias of order 20-30% 
may exist ( Couchman fc Carlberg 1992 , Evrard, Summers, fc Davis 1994 , pelb fc Bertschingei 



1994 ) , the difficulty in reliably tracing galaxies in simulations has prevented a good estimate of its 



magnitude (Summers, Davis, & Evrard 1995). For the motivational purposes of this paper, let us 



keep these assumptions as we work to derive the density dependence of the CVT. 

To rewrite Eq. (1) in terms of the density requires some new notation. Let (, a xb( r ) denote the 
cross-correlation of particle sets a and b. The two-point (auto) correlation function of all particles 
in a system V can be written ^■p y p{r). The correlation function can also be written as the average 
of the individual cross correlations 

*M = ^£&<p(r), (2) 
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where N-p is the total number of particles. We can write down a similar expression for the velocity 
dispersion 

<^ 2 >(0 = ^£<^>M> (3) 

where ("V^p)(r) is the variance of the pairwise velocity between particle i and all particles in 
V which lie a distance r from i. We show in Appendix A that these expressions lead to a 
generalization of the CVT that holds for each particle: 

(^)(r)«^ xP (r)r 2 . (4) 

If the CVT holds for each particle, then it holds for any subset, S C V: 

(4xp)W«^5xp(r)r 2 , (5) 

where 

^SXp)iT) = -fr E <t&P>(»0, (6) 
5 iescv 

and 

Zs«p(r) = — E (7) 

Let Ni{r) be the number of particles within a radius r of the ith particle, and <S n the set of all 
particles for which iVj = n. For this subset, the CVT is 

{v 2 )(n,r) = C 1 nan,r)r 2 , (8) 

where (v 2 )(n,r) = (u| nXP )(r), £{n,r) = £s n xp(r) and d is given by Eq. (A32). The quantity 
£(n, r) is related to the expected number of particles within a radius r around any particle by 

v f \l + £(n, r')]47rr /2 dr' = n, (9) 
Jo 

where v is the mean number of particles per unit volume. If £(n,r) oc r~ 7 and £(n,r) S> 1 (as 
assumed in the original P76 derivation), the above expression yields 

3 

£(n, r)n(r) = n, (10) 
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where n{r) = is^irr 3 . 

Inserting the above expression into Eq. (8) yields 

(v 2 )(n,r) = C^nn/nir) 1 / 3 , (11) 

where C 2 = (3 - 7)/3(4tt^/3) 2 / 3 . The above expression shows that the pairwise velocity dispersion 
is proportional to O and the local density (through n) smoothed on a scale r. Finally, since we 
are working with iVj (r) , which is the number of particles within a volume of radius r centered on 
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a particle, it is convenient to work with a similar velocity dispersion. Let cr 2 (r) be the average 
pairwise velocity dispersion of all the particles within a volume of radius r. cr 2 (r) can be obtained 
by integrating (v 2 )(r) under the same assumptions used to derive Eq. (1) 

where C 3 = (3 - 7)/(5 - 27). Substituting Eq. (12) into (11) results in 

<j 2 v {n,r) (xttn/n{r) l / ? > (13) 

where the constant of proportionality is equal to 

Crr = 30QM 7 (3- 7 ) 2 

1 2 3 4(7 -l)(2 -7)(4-7)(5 -2 7 )(47rP/3) 2 / 3 ' 1 ' 

and Q an d M 7 are defined in Appendix A. 

The above derivation of Eq. (13) holds whether or not the quantities are averaged over one or 
many particles with the same density. The above velocity dispersion is computed with respect to 
the velocity of the central particle, v. The velocity dispersion can also be computed with respect 
to the mean velocity of the particles in a cell, u. In a given cell, these two values of the velocity 
dispersion will differ by |u| 2 — |v| 2 . The distribution of these differences for all cells in S n will 
peak at zero and have a width on the order of a 2 . Subsequent averaging over many cells with the 
same density will result in zero net difference. Thus, Eq. (13) also applies for velocity dispersions 
computed with respect to the mean velocity in the cell if the results are averaged over many cells 
in S n . 

Eq. (13) conveniently relates two readily computable quantities: the velocity dispersion with 
respect to the mean velocity in a cell of radius r to the number of particles in the cell, which 
is proportional to the density. In the next section, we explore the above form of the CVT with 
N-body simulations. 



3. N-Body Results 

The previous section presented a derivation for a relationship linking the velocity dispersion 
to the local density. In this section we explore the range over which Eq. (13) holds using N-body 
simulations of specific cosmological models with different initial power spectra. We are particularly 
interested in the dependence on Q when the number of objects is of the same order as expected 
from volume limited redshift surveys. 

The simulations we consider are designed to probe a variety of popular cosmological models. 
The four models are: standard CDM with Q = 1, h = 0.5; HDM with Q = l,h= 1.0; open CDM 
with n = 0.35, h = 0.7; and CDM + A with Q C dm = 0.35, ft A = 0.65, h = 0.7. The open CDM 
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and CDM + A models provide two alternatives to standard CDM that increase the ratio of large 
scale to small scale power. They differ in evolution in that structure formation "freezes out" at an 
earlier epoch in the open model as the expansion rate exceeds the rate of gravitational collapse. 
Thus, to achieve the same level of structure today, collapse must begin earliest in the open CDM 
model, later in the CDM + A model, and latest in standard CDM model. While HDM is not 
generally considered a viable theory, it provides a significantly different power spectrum shape 
with which to explore our ideas. 

The initial conditions are designed to treat the models, as much as possible, on an even 
footing. All models assume a Harrison-Zel'dovich primordial power spectrum, and use the same 
random phases for the Fourier modes to generate the initial density field from their respective 
power spectra. The CDM transfer functions are taken from Efstathiou, Bond, & White (1992, 
Eq. [7]) with the parameter T = QcDMh. Although this function was not intended for use 
in open models, it fits more detailed calculations to within 5% (D. N. Spergel 1995, private 
communication). The HDM transfer function is taken from Holtzman (1989, Table 2A, line 52). 
As stated in §1, each model is normalized to have the same linear value of erg = 0.67, so as 
to provide similar correlation strengths and to isolate out the velocity dependencies. Although 
this normalization does not match that predicted from the observed fluctuations in the Cosmic 



Microwave Background for some or all of the models (c.f., Stompor, Gorski, &z Banday 1995 ; 



Gorski et al. 1995), it roughly equalizes the amount of power on the scales where this paper is 



focused. 

Each of the simulations follows 32 3 = 32, 768 dark matter particles within a periodic cube 
of comoving size 100 /i _1 Mpc (10,000 km s _1 ) on a side. The P3MG3A code (Brieu, Summers 



fe Ostriker 1995), which implements the P 3 M algorithm (Hockney & Eastwood 1981; Efstathiou 



et al. 1985) on the GRAPE-3A hardware board (Okumura et al. 1993), is used to evolve the 



simulations from redshift z = 23 to z = using 1200 time steps. A Plummer force law with 
softening parameter of 156 h~ 1 kpc is used for the gravitational interactions and the mass per 
particle is 8.5 x 10 12 Uh^ 1 Mq. Each simulation took approximately 2 hours to run on a Sun 
Sparc 10/51 workstation with a 4 chip GRAPE-3A board. 

The ideal way to construct an artificial galaxy catalog is by identifying concentrations of 
gaseous and stellar material in high resolution N-body/hydrodynamic simulations. Current 
computer technology and algorithms now permit such simulations on the scale of small groups 



of galaxies ( Evrard, Summers, fc Davis 1994 ). However, at the present time the realm of large 
volumes remain the domain of strictly gravitational N-body codes. Identifying galaxies from 
dark matter halos is a significant problem that may not be solvable ( [Summers, Davis, fc Evrard 



1995 ); although a variety of of impressive methods have been developed ( Efstathiou et al. 1988 



Bertschinger fc Gelb 1991] ) . However, to keep things as simple as possible we choose a model 



in which mass traces light, and each particle is assumed to be a galaxy. This approach neglects 
important processes, such as mass and velocity bias, the dependence of bias on cosmological 
models, and the interaction of dark matter halos. Nevertheless this approach should be sufficient 
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for our purely motivational purposes. Future simulations which can both cover large volumes and 
resolve galaxies hydrodynamically will hopefully clarify the nature of the bias. 

To extract the velocity dispersion of a particular surface of constant density, consider a set of 
particles where Xj and v, are the position and velocity of the ith particle. Place spherical cells of 
radius r on a uniform grid over the entire domain. Let Nj be the number of particles in cell j. 
The correct way to compute the velocity dispersion in a cell is with respect to the mean motion of 
the particles. As was argued in §2, Eq. (13) will apply if the results are averaged over many cells 
with the same density. Denote the mean velocity in the jth cell by u.,; the velocity variance, a 2 , is 
then |vj — Uj| 2 averaged over all particles in the cell. If <S n is the set of all cells having n particles 
(i.e., Nj = n), then the average velocity and variance as a function of n is 



Pv(n,r) = — N- ( 15 ) 

and 



*^)vE4 ( 16 ) 

where N$ n is the number of particles in the set S n (Note: N$ n ^ n). These equations provide a 
specific prescription for computing the density dependence of the mean velocity and the velocity 
dispersion, which can readily be applied to the simulations. 

Plots of fi v (n,r) and a v (n,r) are shown in Figure 2 for the four models. The cell size is 
r = 194 km s -1 , corresponding to n(r) = 1. All distances are expressed in units of km s _1 . 
Figure 2 demonstrates three important points: both fi v and a 2 are independent of the shape of 
the initial power spectrum; /j, v is independent of the local density, while a 2 is proportional to the 
density; and fi v and a 2 have the same strong SI dependence that (v 2 ) demonstrated in Figure lb. 

Formally, the power spectrum independence is explained by the derivation of the CVT 
(see Appendix A). The velocity dispersion depends upon the evolved power spectrum, which 
is essentially indistinguishable between models (Figure 1). Moreover, any remaining difference 
between models is encoded in the density distribution function, which is not apparent when 
density is the dependent variable in Figure 2. 

To explore the range over which Eq. (13) is valid, fi v and a 2 were calculated over the range 
of cell sizes 77 km s -1 < r < 488 km s" 1 , corresponding to 2 -4 < n(r) < 2 4 . We obtained the 
following empirical fit for the mean velocity magnitude 

H V oc VL l l 2 /n a . (17) 

where a ~ 0.05. As fi v already contains the desired Q dependence, it is convenient to represent a 2 
in terms of the normalized velocity dispersion a 2 /^ 2 ,, with the resulting fit 

^/^ocn/n 1/3 , (18) 
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which agrees with the theory to within the small factor n . 

The quality of the fits can be observed by plotting ji v and a% / nl against the scaled density 
(n/n 1 / 3 ), for each value of r. Figure 3a shows the ratio of fi v to the fitted value computed from 
Eq. (17) for the Q = 1 and f2 = 0.35 CDM models. Each line of data corresponds to a different 
value of r, and has been offset from the next by one unit. Figure 3b shows a similar plot for the 
normalized velocity dispersion o 2 z j$ u . The solid lines are the best fits given by Eq. (18); this 
has not been divided out. The larger scatter at higher densities and smaller scales is due to the 
small number of cells contributing to the calculation at these values. The key point to observe 
from Figure 3a is how well the data points follow the horizontal lines corresponding to their fitted 
values. 

The fits to the data shown in Figure 3 are remarkable, considering that the range includes 
cells with underdensities (n/n — 1) of —0.8 on scales of 488 km s _1 and cells with overdensities of 
nearly 200 on scales of 77 km s _1 . Furthermore, the scaling of a 2 is almost exactly that derived 
from the CVT, indicating that Eq. (13) holds over a large range of scales and densities, even when 
i(r) ~ 1. 



4. Redshift Dispersion 

The purpose of §2 was to theoretically motivate the local density and Q dependence of a%. In 
§3 the scaling relations in §2 were explored with simple N-body simulations. In addition, §§2 and 
3 have introduced the ideas which allow us to describe the main point of this paper — the redshift 
dispersion. 

The formalism we have developed so far cannot be directly applied to observations due to the 
geometric projection of a six dimensional phase space into a three dimensional redshift survey. The 
redshift of galaxies represents the only probe, albeit indirect, of the peculiar velocity. Thus, any 
statistic that desires to take advantage of the properties of a% must be defined with the specific 
geometry of redshift space in mind. In this section we now describe the redshift dispersion, a 
statistic with the aim of being readily measurable from volume limited samples taken from redshift 
surveys and which captures the essence of a% . 

Now let us define quantities analogous to those in §3, but for redshift space. Consider a 
volume limited sample from a survey out to a maximum redshift of Z. Each data point consists of 
two angular coordinates on the celestial sphere and a redshift. Let us define cells within which to 
measure density and velocity dispersion in projection on the celestial sphere. The cell j consists 
of a cone emanating from the origin with solid angle ttO 2 around the cell center. The number of 
points in the cone j is Nj and is proportional to the projected density on the sky. The mean and 
the variance of the redshifts in the cone are denoted Uj and a"j , which are depicted schematically 
in Figure 4. If S n is the set of all cones with Nj = n, then the average and variance of the redshift 
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as a function of n is 



H z (n,6) = 



1 



Z u 3 



(19) 




and 



a 2 z (n,e) = 



1 



(20) 




j€<S, 



in analogy with Eqs. (15) and (16). 

The efficacy with which the a z statistic might distinguish between models is examined with 
the simulations discussed in §3. The simulations were transformed into redshift-space using 
Z = 5000 km s _1 , which is equal to one half of the simulation box size. Since the data are 
periodic, the origin can be placed at any point, allowing multiple perspectives to be drawn from 
a single simulation. We computed a z for f2 = 1.0 and O, = 0.35 CDM models. The angular cell 
size was = 1.8° (n(0) = 2), and the cell centers were computed by creating a pseudo-uniform 
grid of points across the celestial sphere (Baumgardner & Prederickson 1985). The data have been 
averaged over 27 different origins regularly distributed throughout the domain, and error bars 
computed from the standard deviation of this averaging. 

The a z curves are plotted in Figure 5. As with the velocity dispersion in Figure lb, the 
redshift dispersion shows a strong separation between the low and high O, models. The differences 
become apparent at moderate angular over-densities (6 ~ 5). The shape of the curves in Figure 5 
is due to the combined spatial and peculiar velocity contributions to a z , as is illustrated in 
Figure 6 for the S7 = 1 CDM simulation. We know the full six-dimensional position of each galaxy 
in phase space in the N-body simulation, and thus can separate these two contributions. At low 
overdensities, a z is dominated by the spatial separation of the particles, as is indicated by the solid 
points in Figure 6; the spatial component scales approximately as n -1 / 2 , due to the more tightly 
bound nature of denser systems. At higher overdensities the peculiar velocity dominates, scaling 
approximately as n 1 / 2 , which is consistent with results of §2 and §3. For smaller values of f2 the 
spatial component behaves the same, but the peculiar velocity component is down by a factor of 



These results indicate that the greatest separation in the dispersion between models with 
different values of £1 will occur in the denser regions; so it is important that we sample many 
modestly dense regions, which requires a large volume. In addition, to minimize projection effects 
the sample should not be too deep (i.e., Z not too large). Therefore, to apply the a z statistic 
requires a dense sample over a wide field. If believable simulations of galaxies can be developed it 
might be possible to constrain models with the same final correlation function by varying £1 in the 
simulations and finding the best fit to the observations. For any one point, Figure 5 indicates an 
error of about 0.15 in Q. Using many points along the curve, the errors may be small enough to 
significantly constrain the value of Q. 

The current redshift surveys do not have enough data to attempt such a comparison. To 
get the level of separation shown in Figure 5 required O(10 4 ) particles. We have calculated a z 



n 1 / 2 . 
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from the IRAS 1.2 Jansky survey ( Fisher et al. 1995| ), but a volume- limited sample taken from 



this survey contains only 800 galaxies at best. The error bars from an 800 point sample in the 
simulations were too large to be able to distinguish between high and low Q models. 

Fortunately, a substantial increase in the amount of data available will be brought about by 
the Sloan Digital Sky Survey (SDSS). The SDSS will obtain spectra and measure the redshifts 
of the 10 6 galaxies down to r' ~ 18 flGunn fc Weinberg 1995 ). We can estimate the size of a 



volume-limited sample taken from the SDSS using the Schechter luminosity function <p s fitted to 
the Stromlo-APM survey ( Loveday et al. 1992| ) 



4> s (L)dL = <P*y a e-ydy, y = L/L\ (21) 

where <f>* ~ 1.4 x 10~ 8 km~ 3 s 3 , M* ~ —19.5, and a ~ —0.97 are parameters obtained from the fit. 
The number of galaxies in a given volume, V, brighter than Lq, N(V,Lq), can be computed by 
integrating the luminosity function 

/*oo /*oo 

N(V, L ) = V <p(L)dL = V / 4>*y a e-ydy = V<f>*T(a + 1, y ), (22) 

where yo = Lq/L*. For a volume limited survey, Lq is the luminosity an object would have if it 
had an apparent magnitude tuq and was located at the volume edge Z (Hq = 100 km s _1 Mpc -1 ). 
From this definition it follows that 



ZL s -ilO°- 4(M *~ m ° +15) , (23) 



where mo is the apparent magnitude limit of the survey. The Stromlo-APM survey was taken in 
the bj band, while the Sloan will use the r' band. The two can be equated by the approximate 
relation bj ~ r' + 1. However, we set thq = 18.7 which gives a better value for the estimated total 
number of galaxies in the survey (10 6 ). Inserting Z = 5000 km s _1 into Eq. (23) gives yo ~ 0.013. 
Setting Vsdss = ftZ 3 /3 results in N$dss ~ 7000. This number might in fact be appreciably 
larger if there are many more faint galaxies than Eq. (22) implies, and has been suggested by 
recent surveys for low-surface brightness galaxies (cf., palcanton 1995| ). However, these galaxies 
are unlikely to have redshifts measured as part of the SDSS. 

One could substantially increase the number of galaxies in a volume-limited survey of fixed 
depth by dropping the requirement that it include the origin. Figure 7 plots the expected number 
of galaxies in a series of volume limited shells for SDSS and shows that a redshift shell between 
25,000 km s _1 and 30,000 km s _1 would include almost 10 5 galaxies! 

Ultimately, applying the redshift dispersion statistic to the SDSS could provide many data 
points for comparing with simulations and perhaps constraining 17. The a z curves can be evaluated 
for many angular sizes and redshift shells for both the SDSS and for several next generation, high 
resolution, N-body/hydrodynamic simulations with different values of Q. Since the number of 
estimated objects in the SDSS is the same order or more as the simulations shown in Figure 5, 
one might expect to obtain an equivalent separation between models for each value. It should be 
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interesting to compare a z measured in different simulations with the SDSS as a function of n, 8 
and shell geometry. 

5. Conclusions 

Our goal has been to present a new statistic — the redshift dispersion (a z ) — that is sensitive 
to Q and is well suited for comparing real and simulated volume limited samples from redshift 
surveys. Given the proper data, a z is easy to compute and can be applied on many scales without 
applying arbitrary assumptions. We have used low resolution simulations, which are sufficient 
for our motivational purposes, to do a simple exploration of a z , which suggests that applying it 
to the SDSS may be worthwhile. In addition, with the right simulations, it might be possible to 
constrain f2 in models where the simulations match the observed final correlation function. 

We have shown that the pairwise velocity dispersion is intrinsically related to the local density 
and 0. In §2 and Appendix A it is shown that the CVT holds for each particle in a system 
and subsequently for any subset of particles, if we are careful to define the velocity dispersion 
and correlation function for these subsets appropriately. The density dependence of the velocity 
dispersion can be extracted by looking at subsets of particles of a given local density (see Eq. 13). 
Knowing how a 2 depends upon the local density gives an indication as to why the standard 
approach of averaging (v 2 ) over all densities is highly sensitive to the presence of rare density 
peaks. Exploring Eq. (13) with N-body simulations of several cosmological models indicates that 
it holds over a wide range of length scales, 77 km s _1 < r < 488 km s . Our redshift dispersion 
statistic is simply the redshift-space analog of the quantity a v . 

We have treated our galaxies as equal mass particles containing all the mass, ignoring the 
hypothesized global stochastic mapping from the dark matter mass and velocity distribution to 
the distribution of galaxies (i.e. the mass and velocity bias functions), which may differ among 
the models. A more general treatment would estimate the mass and velocity bias in each model 
and normalize the models such that the galaxy correlations, not the dark matter correlations, are 
similar. At best, reliable estimates of these bias functions await the next generation of simulations 
where galaxies can be resolved within statistically meaningful volumes. However, there are several 
arguments as to why the differences in the biases between models may not strongly affect our work. 
First, the initial power spectra of currently favored hierarchical models all have similar slopes 
on galaxy formation scales. Hence, the initial conditions of galaxy formation — and the resulting 
bias — may be similar. In addition, the HDM model, which has a very different initial power 
spectrum, has a similar final correlation function, which suggests that the final power on small 
scales is dominated by non- linear processes which erase initial differences. Any velocity bias which 
arises through dynamical friction should be similar for models evolved to similar clustering levels. 
If velocity bias is related to galaxy formation sites preferentially near potential wells, then velocity 
bias could depend on mass density and the efficacy of this measure could be diminished. However, 
both types of bias appear to be strong only in the most non-linear regions (5 ^ 200) flSummers^ 



-12- 



Davis, & Evrard 1995), while our study focuses on the mildly non-linear regimes (5 ~ 10). 

High resolution, hydrodynamic simulations that include detailed galaxy formation should 
improve our understanding of bias and how it changes from model to model. Intuitively, any bias, 
by eliminating objects in the underdense regions, should have the overall effect of shifting the 
data points in Figure 5 to the left. Higher resolution will also incorporate the interactions of dark 
matter halos, which may significantly effect the three point correlation function on galaxy scales 



(see Bartlett fc Blanchard 1996 ). All of these effects indicate more detailed simulations, beyond 



what is currently available, may be necessary for the actual application of the redshift dispersion. 

The next logical step is to attempt to apply a z to denser redshift surveys than the IRAS 1.2 
Jansky survey. In the mean time, additional studies on larger volume, higher resolution N-body 
simulations would be useful, but perhaps overkill until a suitable redshift survey becomes available. 
Also, it is not clear that using dark matter halos without the proper means for identifying galaxies 
would add to these results. Further exploration should also be done on a wide variety of survey 
geometries. Here, we only looked at a single small sphere. It is quite possible that a larger sphere 
or a shell might be the optimal shape to balance the tradeoff between high density and large 
numbers of clusters that make a z work best. 
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A. Cosmic Virial Theorem 

In §2 we showed that the CVT applied to sets of particles provided it holds for individual 
particles. To prove the latter result we return to Peebles' original derivation (see P76). pavis 



fc Peebles 1977| later rederived the Cosmic Virial Theorem by placing it in the context of the 
BBGKY hierarchy. We choose to use the earlier approach because of its simpler, more intuitive 
nature. The essential derivation is still that found in P76, but a few minor modifications have 
been added to elaborate key steps in the derivation. 

The derivation of the CVT proceeds in the following manner. First, an equation linking the 
velocity, acceleration, and correlation function of a general particle system is obtained from the 
conservation of phase space density. Second, the accelerations are linked through gravity to the 
two and three point correlation functions. Third, the assumptions of a cosmological system are 
applied to the general particle equation to obtain the scaling between the velocity dispersion and 
the density. The key modification that we have made in order to show that this result applies for 
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individual particles in the system is to replace the entire phase space with the phase space of a 
single particle. 



A.l. Phase Space Conservation 

To start, consider a system of particles with the position, velocity, and acceleration of the 
ith particle given by Xj, Vj, and aj. Strictly speaking, these are not phase space variables, 
but represent a three-dimensional single-particle distribution and its evolution in time. Now 
consider any two particles i and j having relative position, velocity, and acceleration: r = Xj — Xj, 
v = Vj — Vj, a = a.j — &i, as shown in Figure 8a. The rate of change in the distance, r, between 
the two particles obeys 

r = v r = r-v/r, (Al) 

and the acceleration 

r= (r-a + v 2 - (r-v/r) 2 ) jr = (r-a+w 2 )/r, (A2) 

where v 2 = v 2 + v 2 . For a sufficiently large system, the number of particles found in a volume of 
phase space centered on the ith particle is dNiyp = fiyp(r,r,r)drdrdf, where fiyp is the phase 
space density of all pairs connected to the particle i. The function fi*p is related to the two point 
correlation function by 

dr / drdrfiyp = 9(1 + ^yp)47rr 2 (ir, (A3) 



where v is the mean number density and ^iyp = £,ixp(r) is the isotropic two point correlation function 
of the ith particle, which is defined with respect to the joint probability dPi = dVidVjV 2 {l + Cixp)- 

For convenience, we now explicitly drop the subscript ixV notation: /j><p — > /, Niyp — > N, 
and —> It should be understood that all these quantities are relative to the ith. particle, 
including the velocities and accelerations. 

Having defined the particle system and the phase space density, /, it is now possible to look 
at the time evolution of the particle within the system. The integral of / links it to £ giving the 
total number of particles with separation between r and r + dr. The number of particles with 
separation less than r is 

N(r, *)= f dr J drdrf. (A4) 
At some later time 8t there exists a 5r such that 

N(r,t) = N(r + Sr,t + St). (A5) 

Without loss of generality, we could have just as well started at the point N(r — Sr, t), and chosen 
5t and 5r to satisfy 

N(r-Sr,t) = N(r,t + St), (A6) 
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which allows us to expand around r and t. Subtracting N(r, t) from the above expression, we have 

- [N(r, t) - N(r - Sr, t)} = N(r, t + St) - N(r, t). (A7) 

The right side can be readily expanded in powers of St and is simply StN + ^St 2 N. Likewise, from 
the definition of N(r, t) the left side is given by 

- [N(r, t) - N(r - Sr, t)] = - f drdr f drf. (A8) 

J Jr—Sr 

The Taylor expansion of the integral gives 

/r' 
drf ~ \5r[f{r) + f(r — Sr)] 

~ Srf — ^5r 2 d r f 

~ Strf + \St 2 {rf -r 2 d r f), (A9) 
where the last step made use of the following expansion for Sr 

Sr = Str + i(5t 2 r + . . . , (A10) 

which is valid for fixed r and f. 

Matching powers of St gives an expression for iV 

N = —J drdr(r — r 2 d r )f 

= —J drdr rf + d r J drdr r 2 f 

= _p 4 7r[r 2 (l+e)(r)-9 r (r 2 (l + 0(r 2 ))], (All) 
where we have used d r r = 0, but d r (r 2 ) / 0. The time derivative of can be written 

N = 4vr^ 2 f drr 2 {I + 0= f drr 2 '(, (A12) 
Jo Jo 

which is combined with the previous expression to give 

N otd r [r 2 (l+0(r 2 )} -r 2 (l+0(r) = f drr 2 't (A13) 

Jo 

Substituting in the expressions for r and r, and expanding the r derivative results in the general 
equation derived in P76 

rd r [(l+0(v 2 r )] + (l+0(2v 2 r -v 2 ) = (l + e)( r .a)+r- 1 f drr 2 'i (A14) 

Jo 

This equation represents the conservation of phase space for a single particle in a system, and 
is true for any particle in the system with a well defined phase space density and an isotropic 
two-point correlation function. 
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A.2. Gravitating Particles 

For gravitationally interacting particles, the acceleration term is simply (r-a) = (r-(gj — Si)), 
where 

» = G E^i?^i3- ( A15 ) 

When averaging over phase space, gj can be written in terms of the two and three point correlation 
functions, which requires considering a third particle at x/% (see Figure 8b). The average force on 
i is the force from j plus the force due to k weighted by the conditional probability of the third 
particle being located at s 

Gmr f s dP 3 

{Si) = ~^- + Gm 

r 6 J s 6 (1P2 

= ^ + ^fd 3 s^[l+ar) + m + a<l) + ar,s,q)} (A16) 

where m is the average mass per particle and the three point correlation function ( is defined by 
the probability 

dP 3 = dVidVjdVkP^l + £(r) + £(s) + C(r,s,q)}. (A17) 

The first, second and third terms in the brackets integrate to zero by isotropy leaving 

(»> = + IT^ / d3s ^(q) + C(r, 8, q)}. (A18) 

Likewise, the force on j is 

<*> = "^r " iTe / d3q ^ {s) + ( A19 ) 

Because £(r, s, (?) = C( r > 9j ■?), the average of the total force is 

(r-(ft- -»)> = - 1^/ ^<£?K(<f) + C(r, *,«)]. (A20) 



A. 3. Cosmological Scalings 

To obtain the scaling of the velocity dispersion we apply various approximations that are 
consistent with a cosmological model. First, we consider the situation where the correlation 
function is assumed to be stationary in comparison to the motions of the particles, i.e., the time 

1/2 

scale for changes in £ is much greater than the crossing time r/(v^) — > £ = 0. In addition, 
v is randomly oriented so that (v 2 }/3 ~ (v%)/2 ~ (u^), and (2^ — v\ ) = 0. These assumptions 
eliminate two terms from Eq. (A14). Combining with Eq. (A20) gives 

rd r [(l +0(v 2 )] = ~ 6Gm(1+6 -6Gp j d^Uq)+ar,s,q)]. (A21) 

T J S 
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The next step is to evaluate the integrals on the right. At this point we impose a model for £ and 
(. Observations indicate that the correlation functions are well modeled by power laws of the form 



£(r) = , 7 = 1.77, r = 5.4/i _1 Mpc (A22) 

and, 

C(r, s, q) = Q[Z(r)£(8) + £(r)£(g) + Q ^ 1.0. (A23) 

Using these relationships, the first integral can be solved with the following identities rs = rs cos 9, 
and q 2 = r 2 — 2srcos0 + s 2 , and by noting that in spherical coordinates d 3 s = s 2 ds sin 9 dOdcj) 

y> s ™£( g ) = J s ^ds S m9dedcp r -^^-i{q) 

= 2tt J dsdOr cos sin £((r 2 - 2sr cos + s 2 ) 1/2 ) 
= 2nr 2 J dy dO cos 9 sin 6>£((r 2 — 2yr 2 cos 6 + y 2 r 



/•OO /*7T 

= 2<r 2 dy - 
Jo Jo (1 

27r£r 2 J 7 



... , ^V^ 1 ^ 

<i# cos sin ( 



(1 -2ycos9 + y 2 p/ 2 



(A24) 



(2- 7 )(4- 7 )- 
where J 7 = J °° dyl 1 (y)/y 2 , y = s/r, and 

J 7 (y) = {y + 1) 2 ^{1 - (2 - 7 )y + y 2 } - \y - 1| 2 ^{1 + (2 - 7 )y + y 2 } (A25) 

For the second integral, we obtain a similar result 

/ d 3 s r -^-((r, s,q) = J d 3 s^Q[£(r)£( S ) + £( r )£( g ) + 

= Q|d 3 s^)[e(r) + e( s )] 



27rQgVM 7 
(2- 7 )(4- 7 )' 



(A26) 



where M 7 = / °° dy(l + y^)I 1 (y)/y 2 . For our purposes, it is sufficient to know that J±.g and Mi. 8 
are of order unity. Inserting the above results into Eq. (A22) gives 

rdUl+Mv 2 )]- 6Gm ( 1+ ^ ^Gpjr 2 J, l2nG P Qer 2 M, 

rd r [(l + 0(1, )j - - ( 2 -7)(4- 7 ) " (2- 7 )(4-7) ' ^ 

Finally, let £ 3> 1. Dividing through by r, and integrating from r to oo and dividing again by £, 
gives an equation for the velocity dispersion 

y„2w . 6Gm 127rGpr 2 J 7 6^GpQ£(r)r 2 M 7 

(7 + l) r + (2- 7 )2(4- 7 ) + ( 7 -l)(2- 7 )(4- 7 ) 1 j 
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For small r the third term dominates the second. The first term is negligible in the limit that m 
is very small, i.e., that the mass is divided up into many tiny individual particles. Thus the third 
term dominates, leaving the classic result 

{ H r) -( 7 -l)(2-7)(4- 7 )- [ ' 

Using the definition H 2 = 8irGp/3£l, we can rewrite the above expression as 

( K ) "4( 7 -l)(2-7)(4-7)- (A30) 

which, if we return to the earlier notation (measuring distances in terms of velocity Hr) gives the 
expression quoted in Eq. (4) of §2 

(^)(r)oc^ xP (r)r 2 , (A31) 

with a proportionality constant 

1_ 4(7-l)(2-7)(4-7)- (A32) 
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Separation, r [km s _1 ] 

Fig. 1. — Two-point correlation function (a) and pairwise velocity dispersion (b) measured in 
N-body simulations of four cosmological models. Each model was normalized to have similar power 
on small scales (i.e., as = 0.67). 
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Fig. 2. — (a) Mean flow fj, v and (b) velocity variance a% as a function of overdensity, n/n — 1, for 
N-body models. These data were computed using 10 5 cells with r = 194 km s _1 and n = 1. 
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Fig. 3. — Empirical fits to (a) fi v and (b) cr 2 , as given by Eqs. (17) and (18). The vertical 
axis has been scaled so that each line of points corresponds to a different cell size. From top 
to bottom the cell sizes are r(km s" 1 ) = [488,388,308,244,194,154,122,97,77] corresponding to 
n(r) = p 4 ^ 3 ^ 2 ^ 1 ^ ^- 1 ^ -2 ^ -3 ^- 4 ]. The horizontal axis is in terms of scaled units ra/n 1 / 3 . 
The lines indicate the value obtained from the empirical fits. 




Fig. 4. — Schematic illustration of the mean redshift, fi z , and the redshift dispersion, a z , in a pie 
slice projection of a cluster. 



-24- 



700 - 



o 600 



ig 500 

06 



"I 1 1 1 1 

• CDM 0=1.0 
OCDM n = 0.35 



"i 1 1 r 



"i 1 1 r 




66660099 



6 



9t9o 



o 



O- 



0" 







5 10 

Angular Overdensity, n/n(0) - 1 



15 



Fig. 5. — Redshift dispersion, a z , as a function of angular overdensity for 8000 dark matter particles 
out to Z = 5000 km s _1 averaged over 27 viewpoints. The error bars are the standard deviation of 
these averages. The angular radius of the cell was 6 = 1.8°, corresponding to n = 2. 
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Fig. 6. — Components of the redshift dispersion, function of angular overdensity for 

0, = 1 CDM. The spatial component represents the dispersion due to the positions of the particles, 
while the peculiar velocity component represents the dispersion from the motions of the particles 
themselves. The measured redshift dispersion is a combination of both components (see Figure 5). 
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Fig. 7.- 



Estimated number of galaxies in each volume limited shell of the SDSS. 



-27- 




Fig. 8. — Schematic drawings depicting positions and velocities of (a) two particle and (b) three 
particle systems used to derive the Cosmic Virial Theorem. 



